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"on"; 

Abstract. If the equation of the title has an integer solution with k > 2, then 
^ ! m > 10 9 - 310 . This was the current best result and proved using a method due 

to L. Moser (1953). This approach cannot be improved to reach the benchmark 
m > 10 10 . Here we achieve m > 10 10 by showing that 2k/ (2m— 3) is a convergent 
of log 2 and making an extensive continued fraction digits calculation of (log 2) /N, 
with N an appropriate integer. This method is very different from that of Moser. 
Indeed, our result seems to give one of very few instances where a large scale 
^ I , computation of a numerical constant has an application. 

c3 ! 1. Introduction 

i "i 1 In this note we are interested in non-trivial integer solutions, that is, solutions 

with k > 2, of the equation 

> : l fc + 2 fc + --- + (m-2) fc + (m-l) fc = m fc . (1) 

Conjecturally such solutions do not exist. For k = 1 one has clearly the solu- 
tion 1 + 2 = 3 (and no further ones). From now on we will assume that k > 2. 
Moser [28] showed in 1953 that if (m, k) is a solution of ([T]), then m > 10 1()6 and k 
is even. His result has since then been improved on. Butske et al. [6] have shown 

Q\ ', by computing, rather than estimating, certain quantities in Moser's original proof 

that m > 1.485 • lfj 9321155 . By proceeding along these lines this bound cannot be 
improved on substantially. Butske et al. [6, p. 411] expressed the hope that new 

^ ■ insights will eventually make it possible to reach the more natural benchmark 10 10 . 

Using that E fc (m) = l k + 2 k + ■ ■ ■ + m k < jf™ t k dt and E fc (m + 1) > J™ t k dt we 
obtain that k + 1 < m < 2{k + 1). This shows that the ratio k/m is bounded. 
By a more elaborate reasoning along these lines Krzysztofek |20j obtained that 
k + 2 < m < |(& + 1). This implies that k > 4 and hence 

k + 2 <m <2k. (2) 

Dividing both sides of ([I]) by m k one sees that for every integer m > 2, (0D) has 
precisely one real solution k. It is known that linv^oofc/m = log 2 and we show 
here that in fact the behaviour of k as a function of m can be determined in a much 
more explicit way (Theorem [1] and Section [2]). 

Moree et al. [27] . using properties of the Bernoulli numbers and polynomials (an 
approach initiated in Urbanowicz [30]), showed that Ni = lcm(l, 2, . . . , 200) | k. 
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Kellner [19] in 2002 showed that also all primes 200 < p < 1000 have to divide k. 
Actually, Moree et al. [27J p. 814] proved a slightly stronger result and on combining 
this with Kellner's, one obtains that N 2 \ k with 

N 2 = 2 8 • 3 5 • 5 4 • 7 3 • ll 2 • 13 2 • 17 2 • 19 2 - | [ p > 5.7462 • 10 427 . 

23<p<997 

For some further references and info on the Erdos-Moser equation we refer to the 
book by Guy [H D7]. 

In this note we attack ([I]) using the theory of continued fractions. This approach 
was first explored in 1976 by Best and te Riele [3] in their attempt to solve the 
related conjecture of Erdos [TTJ that there are infinitely many pairs (m, k) such that 
Sfc(m) > m k and 2(m — l) k < m k . In this context they also gave the following 
variant of one of their results (without proof), namely, ([3]) with 0(mr 2 ) replaced 
with o(m^ 1 ). The proof we give here uses the same circle of ideas as used by Best 
and te Riele. It seems that after their work continued fractions in the Erdos-Moser 
context have been completely ignored. We hope the present paper makes clear that 
this is unjustified. 

Theorem 1. For integer m > and real k > satisfying equation ([!]), we have the 
asymptotic expansion 

k = \og2\m - + o( — r ) ) asm ^ oo, (3) 

\ 2 m \m 2 J J 

with ci = f| - 3 log 2 « 0.00389 .... Moreover, ifm> 10 9 then 

h log2( 1 - — - % J , where < C m < 0.004. (4) 



m 



2m m? 



Corollary 1. If (m,k) is a solution of with k > 2, then 2k/ (2m — 3) is a 

convergent pj/qj o/log2 with j even. 

Corollary 2. The number of solutions m < x of (pQ) ; as x tends to infinity, is at 
most O(logx). 

The equation ([T]) seems to be a sole example of an exponential Diophantine equa- 
tion in just two unknowns for which even the finiteness of solutions is not yet estab- 
lished. The best result in this direction is given by Corollary^ which is an immediate 
consequence of the exponential growth of J)j cLS cL function of j and Corollary [TJ 

Corollary [T] is not the only result which relates convergents to solutions of Dio- 
phantine equations. For example, if (xo,yo) is a positive solution to Pell's equation 
x 2 — dy 2 = ±1, with d a positive square- free integer, then x /y is a convergent of 
the continued fraction expansion of yd. On the other hand, in our situation the 
number in question, log 2, is transcendental and its continued fraction expansion is 
expected to be sufficiently 'generic' (unlike that of quadratic irrationals). 

Corollary [1] naturally leads us to investigate common factors of k and 2m — 3. 
This can be done using the method of Moser, but is not in the literature, as before 
there was no special reason for considering 2m — 3. 
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A key role in this arithmetic study is played by the congruence 
Ylf=if _ f (mod |) if r > 1 is odd; 

V \-E P |i,p-i|r p ( modl ) otherwise. 

This identity can be proved using the Von Staudt-Clausen theorem; for alternative 
proofs see, e.g., Carlitz [7J or Moree [25]. Its relevance for the study of (CQ) was first 
pointed out by Moree [26] . 
Given N > 1, put 

V(N) = {p : p — 1 | N} U {p : 3 is a primitive root modulo p). 

By a classical result of Hooley [16] it follows, assuming the Generalized Riemann 
Hypothesis (GRH), that V(N) has a natural density A, with A = 0.3739558136 . . . 
the Artin constant, in the set of primes. If 2k/ (2m — 3) = Pj/qj is a convergent of 
log 2 arising in Corollary [TJ then it can be shown that (oj, 6) = 1 and, if p G V(N 2 ) 
and p divides qj, then 

v v (q 3 ) = v p {2T x - 1) + 1 > 2, 
where we write v p (n) = a if p a | n and p a+1 \ n. All primes p < 2017 are in 
V(N2). For p ^ 3 we have z/ p (3 p_1 — 1) = 1 unless 3 P_1 = 1 (modp 2 ), that is, p is 
a Mirimanoff prime. (It is known that the only Mirimanoff primes p < 10 14 are 11 
and 1006003.) 

The main idea of this paper is, in essence, to make use of the fact that the 
convergents Pj/qj of log 2 have no reason to also satisfy N 2 \ Pj. The first piece of 
information comes from asymptotic analysis and the latter piece from arithmetic. 
Analysis and arithmetic give rise to conditions on the solutions that 'do not feel 
each other' and this is exploited in our main result: 

Theorem 2. Let N > 1 be an arbitrary integer. Let 
log 2 



[Oq, Ql, 0,2, ■ ■ ■] — CLq + 



2N 1 

ai 



a 2 H 

be the (regular) continued fraction of (log2)/(2iV), with pijqi = [ do, Oil, ... , Oj its 
i-th partial convergent. 

Suppose that the integer pair (m,k) with k > 2 satisfies ([I]) with N | k. Let 
j = j(N) be the smallest integer such that: 

(a) j is even; 

(b) a j+1 > 180N - 2; 

(c) (qj, 6) = 1; and 

(d) v p (qj) = f p (3 p_1 — 1) + v p (N) + 1 for all primes p G V(N) dividing qj. 
Then m > oj/2. 

Computing many partial quotients (that is, continued fraction digits) of log 2 
is closely related to computing log 2 with many digits of accuracy. Indeed, it 
is a well-known result of Lochs that for a generic number knowing it accurately 
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up to n decimal digits implies that we can compute about 0.97n (where 0.97 ~ 
6(log2)(log 10)/7r 2 ) continued fraction digits accurately. For example, knowing 1000 
decimal digits of tt allows one to compute 968 continued fraction digits. 

It seems a hopeless problem to prove anything about E(logg.,-(jv)), the expected 
value of loggj(jv) produced by the result. However, metric theory of continued frac- 
tions offers some hope of proving a non-trivial lower bound for E(loggj(jv)(C))> where 

and (d) to be satisfied but replace (log 2)/(2N) 



we require conditions 
by a 'generic' £ G [0, 1] \ 
such a £, 



(b) 



In this context recall the result of Levy [21] that, for 



lim 



7T 



1.18. 



(6) 



3-«x> j 12 log 2 

The Gauss-Kuz'min statistics asserts that, for a generic £, the probability that a 
given term in its continued fraction expansion is at least b, equals log 2 (l + 1/b). 



This allows one to deal with the case where we only have condition (b) Likewise a 
result of Moeckel [23] , reproved in a very different way a few years later by Jager and 
Liardet [IT], allows one to deal with the case where we only focus on condition 
Their result says that for a generic £ G [0, 1] \ Q we have 



lim 

n— >oo 



{1 < m < n : q m (^) = a (mod d)} d (p((a,d)) 
n J(d) (a, d) 



where (p denotes Euler's totient function, J(m) = m 2 Yl p \ m {l — 1/p 2 ) Jordan's totient 
and (a, m) the greatest common divisor of a and m. This result shows that (qj, 6) = 1 
with probability 1/2 (note that a natural number is coprime to 6 with probability 
1/3). P. Liardet communicated to us that methods of his paper [22] can be used 
to take into account both conditions (a) and (c) ; also the authors of [15] claim that 



this can be done. We expect that there is a positive constant c\ such that for a 
generic £ satisfying conditions (a), (b) and (c), we have E(log<7j(jv)(£)) ~ C\N as N 



(b) 



and 



tends to infinity. Furthermore, we expect that for a generic £ satisfying conditions 
(d) , E(log<7j(jv)(0) ~ C2N\og f3 N for some positive constants C2 and 
is responsible for N, condition (d) for log' 3 N, while conditions 



j3; condition (b) 
and 



c ) | affect C2- We are definitely not experts in metric aspects of number theory, 
thus leave this problem to the interested reader acquainted with the subject. Indeed, 
we even expect that going beyond computing the expected value of \ogqj(N)(£) is 
possible, and a probability distribution for log qj^){0 can be obtained. 

Using the above results from metric theory of continued fractions and some heuris- 
tics we are led to believe that roughly speaking we can get 

m > 10 257N 

from Theorem [2l Being able to compute the convergents of (log 2) / (2N) arbitrarily 
far, we would expect (taking iV = iV 2 ) to show that m > 10 10 . With the current 
computer technology computing sufficiently many convergents is the bottleneck. 
Taking this into consideration we would expect to get 



m > 10 



0.515r 
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from Theorem El where r is the number of convergents we can compute accurately 
and 0.515 is the base 10 logarithm of Levy's constant ([6]). Note that the fact that 
N2 has many divisors gives us some flexibility and increases the likelihood of the 
heuristics to be applicable. Indeed, our numerical experimenting agrees well with 
our heuristic considerations (see Section Hj). Early 2009, A. Yee and R. Chan [3]] 
reached r > 31 • 10 9 for log 2. On the other hand, Y. Kanada and his team [18] 
computed ir to over 1.24 trillion decimal digits already in 2002, using formulae of 
the same complexity as those used for the computation of log 2 (see [21 Chapter 3] 
for details). Thus, given the present computer (im)possibilities, one could hope to 
show (with a lot of effort!) that m > 10 1()12 . 

Applying Theorem El with N = 2 s ■ 3 5 ■ 5 3 or iV = 2 s ■ 3 5 • 5 4 , and invoking the 
result of Moree et al. [27] that N \ k, we obtain the following 



Theorem 3. If an integer pair (m, k) with k > 2 satisfies ([!]), then 

m> 2.71 39 -10 1 667658416 . 

As an application we can show that uj{m — 1) > 33, this improves on the result 
of Brenton and Vasiliu [5], who have shown that uj(m — 1) > 26, where ui denotes 
the number of distinct prime divisors; see Section 15.11 for further details. 

The fact N2 \ k naively implies that k is of size 10 427 (at least), which is much 
smaller than Moser's 10 10 . However, in this paper we show that the fact actually 
yields that k > 10 10 (and likely even k > 10 10 ) — a modestly small number 
dividing k leads to a huge lower bound for k. Thus, on revisiting [27] after 16 years, 
its main result is seen to be far more powerful than the second author thought at 
that time. 

In the three following sections we prove Theorems (U El and [3j respectively. Our 
final Section [5] is devoted to discussing some problems related to the Erdos-Moser 
equation. 

2. Asymptotic dependence of k in terms of m 
Our proof of Theorem [1] makes use of the following lemma. 
Lemma 1. For any real k > 0, we have 

(l-y) fc = e^(l-^ 2 -^ 3 + ^^y 4 + ^^y 5 + 0(^ wy-0. (7) 

Moreover, for k > 8 and < y < 1, the inequality 

e . ky ^ _ k y2 _ k_ y , + Hk_2l y4 + H^zV y5 _ ^ 

< (i - yf < e~ ky ( y + + %y h 



2 



holds. 



Proof. As for the asymptotic relation in (J7J , we simply develop the Taylor expansion 
of (1 — y) k e ky up to y 5 . Unfortunately, estimates coming from the classical forms 
for the remainder are not sufficient to derive a sharp dependence on k as in ([8]) for 
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the last term. Therefore, we need more drastic methods to quantify the asymptotics 
in (J7J) when < y < 1. 
First note that 

oo 1 

(i-2/K = i-E^ n 

n=2 

y 2 y 3 y A y 5 



• ■ ■ , o < y < 1. 

2 3 8 30 y 

Since all coefficients, starting from n = 2, in this power series are negative and their 
sum is exactly —1, for these values of y we have the inequality 

l- y -- y -- y -- y --^<(l-y) e y<l- y -- y -- y -. (9) 

2 3 8 30 120 v ; 2 3 8 w 

The quantities 

y 2 . y 3 . y A A y 2 . y 3 . y 4 . y 5 . y 6 , im 

xi — and x 2 = , 10 

2 3 8 2 3 8 30 120' V ; 

which appear in lie between and 1 for < y < 1. 

Our next ingredient is Gerber's generalization of the Bernoulli inequality [12] (see 

also Alzer pQ). It states that the remainder after k terms of the (possibly divergent) 

binomial series for (1 + x) a (a,x real with —1 < x) has the same sign as the first 

neglected term. In particular we have for real k > 2 and < x < 1, 

{l-x) k <l-kx+ k{k ~ l) x\ (11) 

and for real k > 3 and < x < 1, 

, k(k-l) 9 k(k- l)(fc-2) , , . 

(l-x) k >l-kx + — -x 2 -^ ^ J -x 3 . 12 

v ; 2 6 V ; 

Using the right inequality in Q and taking x = X\ in fllip we obtain, for k > 2, 

(i-,)^<i-^^ + ^ + ^+ M ^ 11 f-+-+- x2 

v yj V 2 3 8 / 2 V 2 3 8 

= i-\y 2 -ly 3 + k -^j^y A 



<l-^-^ + ^), 4+ ^_ 1)y5 (13 ) 

implying the upper estimate in (jSj). In the same vein, the application of the left 
identity in and of (TT2"j) with x = x 2 results, for k > 3, in 

(i - y) k e ky >^~\v 2 - \v* + ^f^y 4 + ^^y 5 

12 

-ky 6 J2( a nk 2 + b n k + c n )y n , (14) 

n=0 
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where the polynomials p n (k) = a n k 2 + b n k + c n , n — 0, 1, . . . , 12, all have positive 
leading coefficients a n ; moreover, p n {k) > for k > 3 and n = 2, 3, . . . , 12, pi(k) = 
±k 2 - ^k + > for k > 4, and p (k) = ±k 2 - |§£; + |g > for k > 8. Using 
this positivity of the polynomials we can continue the inequality in (fl4l for k > 8 
as follows: 



1 - y) k e ky > 1 - 



2 k n k(k-2) 4 fc(5fc-6) = 



k 

2" 3 

12 



30 



% 6 ^(a n /c 2 + 6 n A; + c n ) 



ra=0 



fc n k n k(k-2) A fc(5Jfe-6) = 



1,2 17. 11 

-At fc H 

6 24 20 



from which we deduce the left inequality in (jSJ), and the lemma follows. 
Proof of Theorem [fl The original equation ([I]) is equivalent to 



(15) 
□ 



m— 1 

i=i 



m 



(16) 



Applying to each term on the right-hand side the inequality from (jSJ) we obtain 
k 



So 



g 2 

2m 2 3m 3 



k k{k-2) fc(5fc-6) fc 3 

"3 ~l n ^ '-'4 ~l ™ i ^5 ~ ^ r, -' fi 



5m H 



771—1 



^ ' \ m 

3=1 



k 



2m? 



So 



30m 5 
k 



3m 3 



S 3 + 



6m 6 
k{k-2) 



S 4 + 



5m' 



2m 5 



S, 



5- 



(17) 



with the notation 



m— 1 m— 1 

n -kj/m 



Sn = ^2j"< 



-k/m 



3=1 3=1 

By (j2j) we have e _1 < 2; < e -1 / 2 , where 2 = e _fc//m , and hence 1/(1 — z) < 
1/(1 — e -1 / 2 ) < 3, and in the closed-form expression of the sum 



m—l 



s, 



3=1 

the second term as well as its z -^--derivatives are bounded: 

dz 



< 



1-z 



< 3e~ 



and 



-k/r. 



<3 n+1 m n e~ k , for n = 1,2,... . 



-k/r. 
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Therefore, we can write the inequality in (1171) as 



k 



S '° 2m> S ' 2 3m 3 " 5 



k _, k(k-2) k(5k-6) k 3 f3 3 k 3 4 k 3 7 k 



m—l 



k 



' Si+ 30m 5 ^ 6m^ fi 



2 + 3 + 6 



r — ' V m 

k 

<S °- ^2 S 2 - ^3^3 + V gm 4 ' S '± + ' 



k _, fc(Jfe-2)_, 



S' 



3 + 



3 5 fc(fc - 2) 3 6 A; 2 



implying 

^° 2mi 2 3m 3 " 3 



fc + k(k 2) + fc(5fc ^6) g , _ Je 3 ^, _ 500A; 3 e - fc 



m— 1 



3=1 



5m H 



< S ° 2m^ 2 



30m 5 



6m 6 



k S ' + k{k 2) S' d + -^S' + 500k 2 e- h , 



3m 3 3 8m 4 4 2m 5 5 



(1* 



where 



-k/m 



dz ) 1 — z 



-k/m 



dz J 2 — 1 



for n = 0, 1, . . . ; 



in particular, 

b °~ z-r 



s' 



z + z 2 



, _ z + 4z 2 + z 3 _ z + llz 2 + llz 3 + z 4 

3 ~ (Z-1) 4 ' 4 ~ (^1)5 ! 



z + 2Qz 2 + 6Qz 3 + 26z 4 + z 5 



S' 



z + 57z 2 + 302z 3 + 302^ 4 + 57z 5 + z b 



with z = e k / m . Since 500fc 3 e k < (2k) 3 < m 3 for k > m/2 > 30, using our 
equation (fl6l) we can write the estimates ( TTBl) as 



k(5k-6) k 3 1 



30m 5 



' 6m 6 6 m 3 



< 1 - S' + — ,S' + 



k , A; (A; 2) , A; 



54- 



5 < 



5m 



2m 2 2 3m 3 3 
Noting that e 1 / 2 < z = e fc//m < e, we find 



0<^< e + 26e2 + 66e3 + 26e4 + e5 < in;-. 



-s' + 



4 ~ 4 2m 5 5 ' m 3 ' 



(19) 



( e l/2 _ !)6 



n _, e + 57e 2 + 302e 3 + 302e 4 + 57e 5 + e 6 nrorAA 
< S 6 < < 658544, 



{e 1 / 2 - 1) 
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^ 1 k z + z 2 k z + Az 2 + z 3 



z — 1 2m 2 (z — l) 3 3m 3 (z — 1) 



4 



- 2) z+ llz 2 + llz 3 + z 



110000 



< — , (20) 

mr 



8m 4 (z - l) 5 

where z = e k ' m . 

We already know that k/m is bounded as m —>■ oo; making the ansatz k/m 
c + 0(l/m), hence z = e fc//m = e c + 0(l/m), we find from (|20|) that 

1 ^ 1 \ 

1 = O — as m — ► oo, 

e c — 1 \my 
hence e c = 2 and c = log 2. Now we take 

k i a & ■ > 

— = log 2 H 1 + C I — - I as m — > oo, 

m 

hence 



m m m? \ m 3 



k/m n 2(2 a 2 + 2b ^ 
z = e k/m = 2 H 1 h O 



m m 2 V rn 3 



asm-> oo. 



Substituting these formulas into ( 1201) results in 
Oi— ) =1 ' 



1 + 2a/ m + (a 2 + 26) /m 2 + 0(m- 3 ) 
log 2 + a/m + 0(m~ 2 ) 6 + lOa/m + 0(mT 2 ) 
2m I + 6a/m + 0(m~ 2 ) 

log 2 + Ofm" 1 ) 26 + 0{rrT 1 ) 
3m^ l + 0(m- r ) 



log 2 2 + 0(m" 1 ) 150 + 0(m- 1 ) / 1 \ 
8m 2 l + 0(m~ l ) \m 3 J 

2a + 31og2 3a 2 -3a + 13alog2-26+f log 2 2-f log2 Q ( 1 \ 
m m 2 \m 3 J 

hence a = — flog 2, b = (3 log 2 — ||) log 2 and, finally, we get the asymptotic 
formula (J3j). 

To quantify this asymptotic expansion, we introduce the function 

Jm{ >~\ z-1 2m (z- l) 3 3m 2 (z - l) 4 
A(A - 2/m) z + llz 2 + llz 3 + z^ 
8^ (z - l) 5 

where 

A = A(C) = log2(l- ^--— 2 
V 2m m z 
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agrees with our k/m up to 0(m~ 2 ). Direct computation then shows that 

f m (0) > 0.005m" 2 - 100m" 3 and / m (0.004) < -0.00015m" 2 + 100m" 3 

for m > 100. Therefore, / m (0) > 110000/m 3 for m > 2202 ■ 10 4 and / m (0.004) < 
-110000/m 3 for m > 734 • 10 6 , so that \f m (C)\ < 110000/m 3 is possible only if 
< C < 0.004. Comparing this result with (1201 we conclude that, for k and 
m > 10 9 satisfying (|16|) . we necessarily have 

k i of 1 ^ m 
— = log2 1- 

m \ 2m m z 

with < C m < 0.004. □ 



Clearly, the strategy to deduce further terms in the expansion ([3]) remains the 
same, but in order to achieve precision 0(m" n ) for an integer n > 2 we have to use 
the Taylor expansion of (1 — y) k e ky up to y 2n+1 (each new term in ([3]) requires two 
extra terms in the expansion of (1 — y) k e ky ). In this way we get 

3 /25 2 \ i / 73 61 2 A _ 2 
k = cm c — — c — 3c m + c H c — 25c m 

2 V 12 / V 8 2 / 

/ 41299 657 2 rno 3 1405 A _ 3 

+ cH c 2 - 598c 3 H c 4 m 3 + 0(m 

V 720 2 4 y v 

w 0.69314718m - 1.03972077 - 0.00269758m" 1 + 0.00323260m" 2 

+ 0.00217182m" 3 + 0(m" 4 ), (21) 

where c = log 2. However, we do not possess any clear general strategy to quantify 
such expansions. Already proving a sharp dependence on k for the remainder of 
the n-th truncation of the Taylor expansion of (1 — y) k e ky (like we do for n = 4 in 
Lemma [T]) seems to be a difficult task. We discuss related problems in Section [5J 

Proof of Corollary [TJ Let (m, k) be a non-trivial integer solution of (pQ). By Moser's 

result we know that m > 10 9 . It follows from Theorem [T] that 

2k 0.0111 . . 

0<log2-- -<- — . 22 

2m — 3 (2m — 3) 2 

By Legendre's theorem, | log 2 — p/q\ < l/{2q 2 ) implies that p/q is a convergent 
of log 2, while log 2 > p/q insures that the index of the convergent is even. Thus, 
2k /(2m — 3) is a convergent pjj qj of the continued fraction of log 2 with j even. □ 



3. The proof of the main theorem 

In this section we prove Theorem [2j The restrictions on the prime factorization 
of qj in that result are established using an argument in the style of Moser given in 
the proof of the following lemma. 

Lemma 2. Let (m, k) be a solution of (CQ) with k > 2. Let p be a prime divisor of 
2m — 3. If p — 1 | k, then 

v v (2m - 3) = z/p^" 1 - 1) + u p (k) + 1 > 2. 

If 3 is a primitive root modulo p, then p — 1 | k. 
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Proof. Using that k must be even, we find that 

2m— 4 m—1 m—3 m—1 m—3 

E ^ = E i k + E ( 2m - 3 - ft" = E ^' fc + E ^' fc ( mod 2m - 3 ) 
j=i j=i j=i j=i j=i 

= m k + m k - (m - l) fc - (m - 2) fc = 2(3 fe - l)(m - l) fc (mod 2m - 3), 

where we used that m k = (2m — 3 + m) k = 3 k (m — l) fc (mod 2m — 3) and (m — 2) k = 
(2m — 3 — m + l) fc = (m — l) fc (mod 2m — 3). On applying (j5J) with I = 2m — 3 and 
r = we then obtain that 



2(3 fe - l)(m- l) fc 



E ^( mod1 )- ( 23 ) 



2m — 3 ^— ' p 

p\2m— 3 
p-l|fc 

If p | 2m — 3 and p — 1 | k, the p-order of the right-hand side is —1. The p-order of 
the left-hand side must also be —1, that is, we must have 

z/ p (2m - 3) = v p (3 k - 1) + ku p (m - 1) + 1 = ^(3 P_1 - 1) + u p (k) + 1, 

where we used that m — 1 and 2m — 3 are coprime. Now suppose that p | 2m — 3 
and 3 is a primitive root modulo p (thus p | 3 k — 1 implies p — 1 | k). If p — 1 \ k, 
the p-order of the left-hand side is < —1 and > — 1 on the right-hand side. Thus, 
we infer that p — 1 | k. □ 

This completes the required ingredients needed in order to prove the main result. 

Proof of Theorem^ Since by assumption N \ k, we can write k = Nki and thus 
rewrite fl22l as 

<^ h_ < °- 0111 (24 ) 

2N 2m -3 2N(2m-3) 2 ' v 1 

We infer that k\/(2m — 3) = Pj/gj is a convergent to (log2)/(2iV) with j even. Since 
p | m implies p — 1 \ k (see, e.g., Moree [261 Proposition 9]), we have (6, qj) = 1. We 
rewrite (1211) as 

Q< log2_P 1< 0.0111 



2iV qj " 2Nd 2 qf 
with (i the greatest common divisor of ki and 2m — 3. On the other hand, 

log 2 pj 1 



2N q 3 (a j+ i + 2)q 



2 • 

J 



hence (a J+1 + 2) _1 < 0.0111/(2iV<i 2 ), from which the result follows on also noting 
that 2m — 3 > qj and invoking Lemma [2] (note that if v p (qj) > 1, then v p (qj) = 
v p {2m — 3) — v p (ki)). □ 

To prove that p | m implies p—l\k one uses that k must be even and takes / = m 
in ([5]), showing that ^ p | m p -x\k \ mus t be an integer. Since a sum of reciprocals of 
distinct primes can never be an integer, the result follows. 
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4. Computation of the continued fractions 

of Theorem [2l We recall that we expect 



We make use of conditions 



(b) 



E(log<7j(jv)(£)) ~ f° r a generic £ G [0, 1] satisfying these conditions. Indeed, on 
the basis of theoretical results, heuristics and numerical experiments, we conjecture 
that C\ = QOn 2 . 



N 


]=]{N) 


a j+ i 


qj (rounded down) 


qj mod 6 


p = p(qj) 


1 


642 


764 


2.383153- 10 330 


-1 


149 


2 


664 


1529 


2.383153- 10 330 


-1 


149 


2 2 


1254 


21966 


1.132014- 10 638 


+ 1 


5 


2 3 


1264 


43 933 


1.132014- 10 638 


+ 1 


5 


2 4 


1280 


87866 


1.132014- 10 638 


+ 1 


5 


2 5 


1294 


175 733 


1.132014- 10 638 


+ 1 


5 


2 6 


8 950 


26 416 


3.458446- 10 4 589 


-1 




2 7 


8 926 


52 834 


3.458446- 10 4 589 


-1 




2 8 


119 476 


122 799 


1.374540- 10 61 317 


+ 1 




2 s -3 


119 008 


368 398 


1.374540- 10 61 317 


+ 1 




2 8 -3 2 


139 532 


782152 


9.351282 -10 71 882 


+ 1 


56131 


2 8 -3 3 


6168 634 


1 540 283 


8.22 71 9- 10 3177670 


+ 1 




2 8 -3 4 


22 383 618 


5167079 


5.128265- 10 11538265 


+ 1 


17 


2 8 -3 5 


155 830 946 


31664 035 


2.257099 -10 80 303 211 


-1 




2 8 • 3 5 • 5 


351661538 


85 898 211 


9.729739 -10 181214202 


-1 




2 8 • 3 5 ■ 5 2 


1738154 976 


1 433 700 727 


1.594940 -10 895 721 905 


+ 1 


5 




1977626 256 


853 324 651 


1.196828 -10 1019 133 881 


-1 




2 8 • 3 5 • 5 3 


2 015 279170 


4388 327617 


5.565 1 96 -10 1 038523018 


-1 


19 




3 236170 820 


2 307115 390 


5.42 78 1 5 -10 1 667 658416 


+ 1 




2 8 • 3 5 • 5 4 


2 015 385 392 


21941 638 090 


5.565 1 96 -10 1 038523018 


-1 


19 




3 236 257942 


11535 576 954 


5.42 78 1 5- 10 1 667658416 


+ 1 





Table 1 . Smallest integers j satisfying conditions (a) , (b) and (c) of Theorem [2] 



The computation of (log 2)/(2N) is done in two steps. First, we generate d digits 
of log 2. For this we use the 7-cruncher [21]. With this program, A. Yee and 
R. Chan computed 31 billion decimal digits of log 2 in about 24 hours. Second, we 
set a rational approximation of (log2)/(2iV) with a relative error bounded by 10~ d . 
Then partial quotients of the continued fraction of (log 2) / (2N) are computed: about 
0.97d of them can be evaluated, with safe error control [I] (cf. the result of Lochs 
mentioned in Section [TJ. We maintain a floating point approximation of numbers qj 
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(rounded down) and residues of qj (mod 6) by the formula qi + \ = a>i + iqi + gj_i for 
i > 0, where q = 1 and g_i = 0. 

Table [1] was created with the 'basic method' of jlj for iV < 2 8 • 3 4 . It was fast 
enough to reach the benchmark m > 10 10 in four days with 50 • 10 6 digits of log 2. 
Bit-complexity of this algorithm (or of the indirect or direct methods [1]) is quadratic 
and reaching the m > 10 10l ° milestone would take centuries. 

Some subquadratic GCD algorithms were discovered that have asymptotic run- 
ning time 0(n(logn) 2 loglogn) [21]. A faster version of the program was written: 
this time a recursive HGCD method is applied. It is adapted for computing a con- 
tinued fraction by using Lemma 3 of [1] (which is similar to Algorithm 1.3.13 of [8]) 
for error control. With it the program leaps over 10 1C|8 in just about one hour. Fi- 
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io 9 



nally, the new benchmark m > 

3 • 10 9 digits of log 2, N = 1555200 and condition [(d) 
conditions 



is established in no more than 10 hours with 
the first found solution fits 



but not (d) With N = 7776000, m > 10 



io 9 



is achieved for the 

smallest j. See Table [TJ in the last column, p is a prime such that p G V(N) and 
v p {qj) = 1, that is, such that condition (d) of Theorem [2] is violated. 

Now, computation time is not a problem to achieve the m > 10 lt)10 milestone, 
a few days will be sufficient on a computer with a large amount of memory. We 
remark that the complexity and hardware requirement for computation of the digits 
of log 2, respectively for computation of its continued fraction expansion, are similar. 



5. Miscellaneous 

5.1. The number of distinct prime factors of m — 1. There is a different ap- 
plication of Theorem [3] suggested by the work of Brenton and Vasiliu [5] , to factor- 
ization properties of the number m — 1 coming from a non-trivial solution (m, k) 
of ([1]). A result of Moser [2H] (which can also be deduced from the key identity ([S]), 
cf. the proof of Lemma [2] above) asserts that 

E - + — ~r G Z; ( 25 ) 

^— ' p m — 1 

p\m—l 

in particular, the number m — 1 is square-free. Since the sum of reciprocals of the 
first 58 primes is less than 2, we conclude that either u(m — 1) > 58 or the integer 
in fT25l) is equal to 1. In the latter case, we can apply Curtiss' bound [9] for positive 
integer solutions of Kellogg's equation 




namely, maxj{a;j} < A n — 1, where the Sylvester sequence {A n } n >i = {2, 3, 7, 43, . . . } 
is defined by the recurrence A n = 1 + YYi=i Ai (for some further info, see e.g. Odoni 
[29]). From this result and the estimate A n < (1.066 • IO 13 ) 2 ™ 7 , we infer 

m< (1.066 -lO 13 ) 2 ^" 1 '" 6 , 
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which together with the lower bound on m from Theorem [3] yields u{m — 1) > 33. 
A similar estimate on the basis of another (l25l) -like identity of Moser implies that 
w(m+ 1) > 32. 

5.2. Generalized EM equation. The method we use in Section [2] for deriving the 
asymptotics of k in terms of m works for the more general equation 



l k + 2 k + ■ ■ ■ + (m - l) fe = tm\ 



(26) 



with t G N fixed, as well. Indeed, the coefficients in the Taylor series expansion 

(i - y) k e ky = l - k -y 2 - k -y" + ^-^y 4 + • ■ ■ = f>(% n (27) 



2" 3 
are polynomials satisfying 

9o(k) = 1, gi(k) = 0, and deg k g n (k) 



?1=0 



g n (0) = forn>2; 



the latter follows from raising the series (1 — y)e y = 1 — y 2 /2 — y 3 /3 
power k. In these settings, equation (1261) becomes 



(28) 
to the 



m— 1 



<=a> 



m 

m— 1 



fc m— 1 x 
kj/m 



n=0 v 7 



E ^"( fc ) jn e -jk/m 



m 

n=0 j=l 



since £°l m j" e - J ' fc / m = 0(m n e~ fc )) 



9n(k) A ?ne _ jfc / m = gn(fc) / ^ _d_\ g \ 
2^ m n ^ m" U dzi 1-z) 

n=0 j=l n=0 v - - 



-k/m 



^ m n y 1 \\ dz z-l 

n=0 \\ / / 



gkfm 



hence in the notation X — k/m and x — 1/m we have 

d\ n 1 



t= £ fc G> 



dz 7 2 — 1 



(29) 



Searching A in the form A = Co + c\% + C2X 2 + • • • , we find successively 



Co = C (t) =log( 1 + - ) =lo. 



t + 1 



Cl = - t + - c, 



C 2 = U + C Z - t+- C--U+- C Z + - 
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and so on. Note that c n (—(t + 1)) = (— l) rt+1 c n (t) for n = 0, 1, 2, ... ; this reflects 
the equivalence of equation (1261) and 

l k + 2 k + ■ ■ ■ + (m - l) k + m k = (t + l)m k . (30) 

From this asymptotics we see that 

2k _ e+ ^-2tjc- tl c3 + 4c/3 + / 1 \ 



2m -h 2{2m-t l ) 2 \(2m-t x f 

where t\ — 2t-\-l and c = log(l + l/t). It can be checked that for all positive integers 
t we have the inequality 

4c 

-0.22 < t 3 lC 2 - 2t\c - he 2 + — < 0, 

3 

and hence 2k /(2m — 2t — 1) is a convergent (with even index) of this logarithm 
c = log(l + 1/t) for m large enough. 

5.3. Saddle-point method. A different approach to treat the asymptotic behaviour 
of k in terms of m for k and m satisfying ([T]) (or, more generally, fT2"6T) ) is based on 
the integral representation 

V(h\ rC+ioo 



1+2 + '" + (m - 1) ^/c_ (^T)^ d2 - 

where C is an arbitrary positive real number (cf. [TQl P- 273]). On noting that 

(m-l)z / i _ p z-C 

' 1 + 



e z — 1 1 — e~ c \ e z — 1 

one obtains, on taking C = (k + l)/(m — 1) and after invoking some rather trivial 
estimates, that 

l k + 2 k + --- + (m-l) k = - _ (1 + PkW) , (32) 

with 



V^(fcTl)C7 

\Pk{m)\ < 



0F(A;-i)( e c_i)- 

(This part of the argument is due to Delange; for more details see pIU pp. 273-274].) 
By (T2]), C is bounded and we infer that \pk(m)\ = 0(k^ 1 ^ 2 ) = 0(m -1 / 2 ). On putting 
m k on the left-hand side of (|32l) and using (1 — l/m) m = exp(— 1 + 0(m -1 )), we 
immediately conclude that, as m — > oo, 

— = log2 + 



where the implied constant is absolute. A more elaborate analysis, using the saddle- 
point method, will very likely allow one as many terms in the latter expansion as 
required. 
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5.4. Experimental asymptotics. It is worth mentioning a fast experimental ap- 
proach of doing asymptotics like (12T1) . Given numerically a few hundred terms of 
a sequence s = {s„}„>i that one believes has an asymptotic expansion in inverse 
powers of n, one can try to apply the asymp fe trick, a simple but often powerful 
method to numerically determine the coefficients in the ansatz 

Cl c 2 

Sn ~ C H h —r H . 

n n z 

As a second step one tries to identify the so-found coefficients with (linear combina- 
tions of) known constants. Thus, one arrives at a conjecture that hopefully can be 
turned into a proof. For more details and some 'victories' achieved by the asymp fc 
method, see Griinberg and Moree [T3] . 

D. Zagier has applied this trick to the sequence of k = k(m) obtained from (TjQ) 
on letting m run through the first thousand values. Excellent agreement with our 
theoretical results was obtained in this way. 
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